J 



Europaisches Patentamt 

® 0))) Eur °P 68n Pat «"t Office ©Publication number: 0 091 527 

Office europeen des brevets A2 



© EUROPEAN PATENT APPLICATION 

© Application number: B2306643.6 © IntCl. 3 : C 12 N 15/00 

© Oateof filing: 13.12.82 " £ 12 P 21/02 C 12 N I 1/20 

w C 07 C 103/52, C 07 H 21/04 

A 61 K 37/02 

//C12R1/1 9, C1 2R1/38, C12R1/07 



© Priority: 14.12.81 US 330912 



@ Date of publication of application: 
19.10.83 Bulletin 83/42 

© Designated Contracting States: 

AT BE CH DE FR GB IT U LU NL SE 



© Applicant: PRESIDENT AND FELLOWS OF HARVARD 
COLLEGE 
17 Qulncy Street 

Cambridge Massachusetts 02138(US) 

® Inventor. Gilbert Walter 
107 Upland Road 

Cambridge Massachusetts 0214O(US) 

© Inventor: Philipp, Barbara Wattner 
32RockledgeRoad 
Newton Massachusetts 021 61 (US) 

© Representative: Bannerman. David Gardner et al 
Withers & Rogers 4 Dyef a Buildings Holbom 
London, EC1N2JTIGB) 



© DNA sequences, recombinant DNA molecules and processes for producing human serum albumln-IIke polypeptides. 
© DNA sequences, recombinant DNA molecules and pro- 
cesses for producing human serum albumin-like polypep- 
tides. The DNA sequences and recombinant DNA molecules 
of this invention are characterized in that they include DNA 
fragments that code for human serum elbumin-like polypep- 
tides. These DNA sequences end recombinant DNA mole- 
cules end the hosts transformed with them may be em- 
ployed in the processes of this invention to produce human 
serum albumin-like polypeptides. 



< 
CM 

in 



o 



0. 
LU 



C/oydon printing Company Ltd. 



BEST AVAILABLE COPY 



0091 527 



B19 CIP 

DGB/JEA " l " 



DNA SEQUENCES, RECOMBINANT DNA MOLECULES 
AND PROCESSES FOR PRODUCING HUMAN SERUM 
ALBUMIN-LIKE POLYPEPTIDES 

BACKGROUND OF TEE INVENTION 

5 This invention relates to DNA sequences, recom- 

binant DNA molecules and processes for producing human 
serum albumin-like polypeptides. More particularly, the 
invention relates to DNA sequences and recombinant DNA 
molecules expressed in appropriate host organisms, 
10 The DNA sequences and recombinant DNA molecules 

of this invention are characterized in that they include 
fragments that code for human serum albumin-like polypep- 
tides. Accordingly, these DNA sequences and recombinant 
DNA molecules and the hosts transformed with them may be 
15 employed in the processes of this invention to produce 
human serum albumin-like polypeptides. 

Human serum albumin is a major protein component 
of human serum. It is synthesized by the liver. It appears 
to control the osmotic pressure of the intravascular fluid 
20 and to bind a variety of metabolites in the blood. The 

amino acid sequence of natural human serum albumin is known 
[B. Meloun et al., FEBS Letters , 58, pp. 134-37 (October 
1975)]. It is a protein of 585 amino acids. 

Today, human serum albumin is prepared by isolat- 
25 ing it from blood samples that are no longer useful for 
transfusions. It has found widespread application as a 
blood supplement for burn patients, as a means to improve 
the performance of the human circulatory system, as an 



0091 527 

-2- 

amino acid source for food additives, as a potential ni- 
trogen fixer, in the treatment of kernicterus (excess of 
bilirubin) in newborn infants and as a pharmaceutically- 
acceptable carrier for many drugs and other compounds 
5 employed for human therapy. 

Since human serum albumin is purified from blood 
samples, it is susceptible to the same contamination as 
blood. For example, human serum albumin may be contami- 
nated with hepatitis B virus particles or other viral and 
* 10 toxic substances that may be transmitted among humans by 
blood transfusion. Plainly, the possibility of such con- 
tamination has limited the use of human serum albumin. 
Moreover, since its major source is about- to-be-discarded 
blood samples, the supply of human serum albumin will 

15 decrease as blood storage conditions improve. A similar 
decrease in the availability of human serum albumin will 
also occur as human serum albumin finds more widespread 
use in pharmaceutical therapy. Accordingly, other sources 
of highly-purified human serum albumin are required. 

20 Recent advances in molecular biology have made 

it possible to produce large amounts of eukaryotic pro- 
teins in bacterial hosts. These include, for example, 
leukocyte interferon (S. Nagata et al., "Synthesis In 
E - coli of A Polypeptide With Human Leukocyte Interferon 

25 Activity", Nature , 284, pp. 316-20 (1980)), antigens of 
human hepatitis B virus (C. J. Burrell et al., "Expression 
ln Escherichia coli Of Hepatitis B Virus DNA Sequences 
Cloned In Plasmid pBR322", Nature , 279, pp. 43-7 (1979) 
and M. Pasek et al., "Hepatitis B Virus Genes And Their 

30 Expression In E. coli ". Nature , 282, pp. 575-79 (1979)), 
SV40t antigen (T. M. Roberts et al., "Synthesis Of Simian 
Virus 40t Antigen In Escherichia coli ". Proc. Natl. Acad. 
Sci. USA, 76, pp. 5596-5600 (1979)), and FMD viral antigens 
(H. Ktipper et al., "Cloning Of cDNA Of Major Antigen Of 

35 Foot And Mouth Disease Virus And Expression In E. coli ". 
Nature, 289, pp. 555-59 (1982)). 

In general, these processes rely on the construc- 
tion of recombinant DNA molecules characterized by a DNA 
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sequence coding for the desired product operatively linked 
to an expression control seguence. Appropriate hosts are 
then transformed with these molecules to permit production 
of the desired product by fermentation processes. For 
5 DNA coding sequences, other than those prepared via chemi- 
cal synthesis, the construction of such recombinant DNA 
molecules comprises the steps of producing a single- 
stranded DNA copy (cDNA) of a messenger RNA (mRNA) tem- 
plate for the desired protein; converting the cDNA to 

10 double-stranded DNA and operatively linking the DNA to an 
appropriate expression control seguence in an appropriate 
cloning vehicle. The recombinant DNA molecule is then 
employed to transform an appropriate host. Such trans- 
formation may permit that host to produce the desired 

15 protein when it is fermented under appropriate conditions. 

SUMMARY OF THE INVENTION 

The present invention provides at least one DNA 
sequence coding for a human serum albumin-like polypeptide. 
More particularly, we provide in accordance with this 

20 invention a DNA sequence characterized in that at least a 
portion thereof codes for a human serum albumin-like poly- 
peptide. The DNA sequences of this invention are selected 
from the group consisting of (a) HSA/33-1, ESA/17-3, 
HSA/33-l(B2lII-EcoRI)-HSA/17-3(B3lII-EcoRI) # HSA/33-1 

25 (?aaI-EcoRI)-HSA/17-3(2aaI-EcoRI) (b) DNA sequences which 
hybridize to any of the foregoing DNA sequences and which 
code for a human serum albumin-like polypeptide; (c) DNA 
sequences, from whatever source obtained including natural, 
synthetic or semisynthetic sources, related by mutation, 

30 including single or multiple, base substitutions, dele- 
tions, insertions and inversions to any of the foregoing 
DNA sequences and which code for a human serum albumin- 
like polypeptide and (d) DNA sequences comprising sequences 
of codons which code for a polypeptide containing an amino 

35 acid sequence similar to those coded for by the codons of 
any of the foregoing DNA sequences and which code for human 
serum albumin-like polypeptide. 
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By virtue of this invention, it is accordingly 
possible to obtain human serum albumin-like polypeptides 
in substantial quantities and in a form that cannot pos- 
sibly be contaminated with hepatitis B virus particles or 
5 other impurities formerly inherent in natural human serum 
albumin isolated from human blood and plasma. The DNA 
seguences f recombinant DNA molecules, hosts and processes 
of using them and the human serum albumin-like polypeptides 
produced by them in this invention avoid the problems which 
10 have beset the other known methods of human serum albumin 
production. Accordingly, they enable large amounts of 
highly-pure human serum albumin-like polypeptides and their 
derivatives to be made available for diverse -uses in the 
pharmaceutical and other industries. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic outline of one embodi- 
ment of a process of this invention for producing the human 
serum albumin-like polypeptides of this invention. 

Figure 2 is a partial restriction map of three 
20 DNA sequences and recombinant DNA molecules of this inven- 
tion: pKT218(HSA/33-l), pKT218 (HSA/17-3 ) and pKT218 
(ESA/33-l(BglII-EcoRI)-HSA/17-3(BglII-EcoRI)) and displays 
the combination of the first two sequences to produce the 
third. 

25 Figure 3 displays a partial restriction map of 

DNA sequence HSA/33 -1 ( Bgl I I -EcoRI ) -HSA/17-3 ( Bgl 1 1 -EcoRI ) 
and the strategy employed to determine portions of its 
nucleotide sequence. The distances indictated are approxi- 
mate. They may be confirmed by nucleotide sequencing. 

30 Figure 4 displays portions of the nucleotide 

sequence of HS A/3 3 -1 ( Bgl 1 1 -EcoRI ) -HSA/17-3 ( Bgl 1 1 -EcoRI ) 
and the amino acid sequence of the coding regions of those 
portions . 

Figure 5 displays a schematic outline of a process 
35 of preparing another recombinant DNA molecule of this inven- 
tion. 
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Figure 6 displays a schematic outline of a pro- 
cess of preparing another recombinant DNA molecule of this 
invention. 

Figure 7 displays a schematic outline of a pro- 
5 cess for preparing anotaer recombinant DNA molecule of 
this invention. 

Figure 8 displays a schematic outline, of a pro- 
cess for preparing another recombinant DNA molecule of 
this invention. 

10 DETAILED DESCRIPTION OF THE INVENTION 

In order that the invention herein described 
may be more fully understood, the following detailed de- 
scription is set forth. 

In the description the following terms are 

15 employed: 

Nucleotide - A monomeric unit of DNA or RNA 
consisting of a sugar moiety (pentose), a phosphate, and 
a nitrogenous heterocyclic base. The base is linked to 
the sugar moiety via the glycosidic carbon (l 1 carbon of 

20 the pentose) and that combination of base and sugar is 
called a nucleoside. The base characterizes the nucleo- 
tide. The four DNA bases are adenine ( M A U ), guanine 
("G"), cytosine ("C"), and thymine ("T"). The four RNA 
bases are A, G, C and uracil ("U"). 

25 DNA Sequence - A linear array of nucleotides 

connected one to the other by phosphodiester bonds between 
the 3 1 and 5 f carbons of adjacent pentoses. 

Codon - A DNAseguence of three nucleotides (a 
triplet) which encodes through mRNA an aaiino acid, a trans- 

30 lation start signal or a translation termination signal. 
For example, the nucleotide triplets TTA, TTG, CTT, CTC r 
CTA and CTG encode for the amino acid leucine ("Leu"), 
TAG, TAA and TGA are translation stop signals and ATG is 
a translation start signal. 

25 Reading Frame - The grouping of codons during 

translation of mRNA into amino acid sequences. During 
translation the proper reading frame must be maintained. 
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For example, the sequence GCTGGTTGTAAG may be translated 
in three reading frames or phases, each of which affords 
a different amino acid sequence 

GCT GGT TGT AAG — Ala-Gly-Cys-Lys 
5 G CTG GTT GTA AG — Leu-Val-Val 

GC TGG TTG TAA G — Trp-Leu-(STOP) 
Polypeptide - A linear array of amino acids con- 
nected one to the other by peptide bonds between the 
a -amino and carboxy groups of adjacent amino acids, 
10 Genome - The entire DNA of a cell or a virus. 

It includes inter alia the, structural genes coding for 
the polypeptides of the cell or virus, as well as its 
operator, promoter and ribosome binding and interaction 
sequences, including sequences such as the Shine-Dalgarno 
15 sequences. 

Structural Gene - A DNA sequence which encodes 
through its template or messenger RNA ( "mRNA" ) a sequence 
of amino acids characteristic of a specific polypeptide. 

Transcription - The process of producing mRNA 
20 from a structural gene. 

Translation - The process of producing a poly- 
peptide from mRNA. 

Expression - The process undergone by a struc- 
tural gene to produce a polypeptide. It is a combination 
25 of transcription and translation. 

Plasmid - A non-chromosomal double-stranded DNA 
sequence comprising an intact "replicon" such that the 
plasmid is replicated in a host cell. When the plasmid 
is placed within a unicellular organism, the characteris- 
30 tics of that organism may be changed or transformed as a 
result of the DNA of the plasmid. For example, a plasmid 
carrying the gene for tetracycline resistance (Tet R ) 
transforms a cell previously sensitive to tetracycline 
into one which is resistant to it. A cell transformed by 
35 a plasmid is called a "transforroant" . 

Phage or Bacteriophage - Bacterial virus, many 
of which consist of DNA sequences encapsidated in a protein 
envelope or coat ("capsid protein"). 
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Cloning Vehicle - A plasmid, phage DNA or other 
DNA sequence which is able to replicate in a host cell, 
which is characterized by one or a small number of endo- 
nuclease recognition sites at which such DNA sequences 
5 may be cut in a determinable fashion without attendant 
loss of an essential biological function of the DNA, e.g., 
replication, production of coat proteins or loss of 
promoter or binding sites, and which contains a marker 
suitable for use in the identification of transformed cells, 

10 e.g., tetracycline resistance or ampicillin resistance. 
A cloning vehicle is often called a vector. 

Cloning - The process of obtaining a population 
of organisms or DNA sequences derived from one such organ- 
ism or sequence by asexual reproduction. 

15 Recombinant DNA Molecule or Hybrid DNA - A mole- 

cule consisting of segments of DNA from different genomes 
which have been joined end-to-end outside of living cells 
and have the capacity to infect some host cell and be main- 
tained therein. 

20 Expression Control Sequence - A sequence of nucle- 

otides that controls and regulates expression of structural 
genes when operatively linked to those genes. They include 
the lac system, the tr£ system, the TAC system, the p-lac 
system, major operator and promoter regions of phage X, 

25 the control region of fd coat protein and other sequences 
known to control the expression of genes of prokaryotic 
or eukaryotic cells or their viruses and various combina- 
tions of them. 

Human Serum Albumin-Like Polypeptides - A poly- 

30 peptide displaying a biological or immunological activity 
of natural human serum albumin ( u HSA n ) . This polypeptide 
may contain amino acids which are not part of natural HSA 
or may contain only a portion of the amino acids of natural 
HSA. The polypeptide may also not be identical to natural 

35 HSA because the host in which it is made may lack appropriate 
enzymes which may be required to transform the host-produced 
polypeptide to the structure and substitution of natural 
HSA. 



0091527 

-8- 

Ref erring now to Figure 1, we have shown therein 
a schematic outline of one embodiment of a process for 
preparing a DNA sequence and a recombinant DNA molecule 
of this invention. 

5 EXAMPLE 

PREPARATION OF A HUMAN FETAL LIVER cDNA LIBRARY 

A human fetal liver cDNA library was prepared 
by David Kurnit of Harvard Medical School, FolyA RNA 
was isolated from human fetal liver using well-known 

10 methods. That polyA RNA was then used as a template to 
prepare single-stranded complementary DNA (cDNA) [e.g., 
A. Efstratiadis et al., "Full Length And Discrete Partial 
Reverse Transcripts Of Globin And Chorion mRNAs", Cell , 
4, pp. 367-78 (1975) and references cited therein]. The 

15 cDNA was then rendered double-stranded by conventional 
methods, tailed with dC residues arid inserted into the 
dG- tailed PstI site of pKT218 (a derivative of pBR322) 
[e.g. L. Villa-Komaroff et al., "A Bacterial Clone 
Synthesizing Proinsulin" , Proc. Natl. Acad. Sci. USA , 75, 

20 pp. 3727-31 (1978); K. Talmadge et al., "Eukaryotic Signal 
Sequence Transports Insulin Antigen In Escherichia coli ", 
Proc. Natl. Acad. Sci. USA , 77, pp. 3369-73 (1980)]. The 
resulting recombinant DNA molecules, comprising the human 
fetal liver library, were then employed to transform 

25 competent E. coli HB101 using standard procedures. 

Dr. Kurnit graciously made this library available to us. 

SCREENING OF THE HUMAN FETAL LIVER cDNA 
LIBRARY WITH MOUSE SERUM ALBUMIN cDNA 

We decided to screen the above-described human 
30 fetal liver cDNA library with mouse serum albumin cDNA 
[D. Kioussis et al., "The Evolution Of cr -Fetoprotein And 
Albumin", Journal of Biological Chemistry , 256, pp. 1960-67 
(February 25, 1981)). The basis for our approach was that 
bovine and rat serum albumins have some amino acid seguence 
35 similarity to human serum albumin [T. Peters, Jr., "Serum 
Albumin" in The Plasma Proteins (F. W. Putnam ed.) r vol. 1, 
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pp. 133-181 (1975)]. Accordingly, we postulated that mouse 
serum albumin cDNA might cross hybridize to human serum 
albumin cDNA to an extent sufficient to allow selection 
of the particular human serum albumin-related cDNA from a 
5 cDNA library containing many cDNA's unrelated to HSA. 

To screen the above-described cDNA library, we 
pooled about 10,000 colonies from the library and grew up 
a culture (10 ml, OD BS0 = 1.8) of the pooled colonies in 
2YT medium supplemented with tetracycline (20 ug/rol) at 

10 37°C. [J. H. Miller, Experiments In Molecular Genetics 
(cold Spring Harbor Laboratory, Cold Spring Harbor, 
New York) (1972)]. Because plasmid pKT218 includes the 
gene coding for tetracycline resistance, E. ccl i HB101 
which has been transformed with pKT218 (having that gene 

15 intact) will grow in medium containing that antibiotic to 
the exclusion of E. coli not so transformed. Therefore, 
growth in tetracycline-containing medium permits selection 
of hosts containing pKT218. 

The culture was centrifuged (10 min, 5000 rpm) 

20 and resuspended in 1 ml 0.5 Tris-HCl (pH8) and lysozyme 
(1.6 rag/ml) and allowed to stand for 15 min in ice. The 
cold mixture was then combined with EDTA (to 8 mM) and 
allowed to remain in ice. It was then combined with a 
mixture (0.5 ml/ml) of 20% Triton X-100 (0.15 ml), 0.5 M 

25 EDTA ( P H8) (3.75 ml), 1 M Tris-HCl ( P H7.9) (1.5 ml), H 2 0 
(4.6 ml) and allowed to stand in ice for 15 min. The lysed 
cells were centrifuged in an Eppendorf centrifuge (5 min, 
12000 rpm, 4°C) to remove the cellular debris from the 
lysed pKT2l8-based recombinant plasmids containing the 

30 inserted human fetal liver cDNA (supernatant). 

The supernatant was applied to a low melting 
agarose gel (0.9%) and the upper portion of the "super- 
coiled" band (5000-7000 nucleotides in size) was removed 
from the gel by melting the agarose at 65°C. We expected 

35 that this band would contain all the P KT218 plasmids having 
cDNA inserts from the human fetal liver library of greater 
than about 750 base pairs. The separated agarose-containing 
pKT218-based recombinant plasmids were then employed to 
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transform E. coli HB101 competent: cells in ice (20 pi of 
gel/100 pi of cells) using conventional procedures [e.g., 
S. R. Kushner, in Genetic Engineering (H.W. Boyer ed.) 
p. 17 (1978)] 

5 The cells were kept in ice for 45 min and then 

heated to 37 °C for 5 roin. At that point 2YT broth (2 ml) 
was added and the cells incubated at 37 °C with shaking 
for 45 min. The cells were then plated on 2YT-containing 
agarose plates (containing 20 vg/ml tetracycline) and grown 

10 overnight at 37°C. 

We next prepared three nitrocellulose filter 
replicates of the agarose plates and incubated them for 
5 h at 37°C. The colonies from two of the filters were 
• transferred to 2YT-containing agarose (supplemented with • 

15 100 pg/ml chloramphenicol) and incubated for 15 h at 37 °C. 
Chloramphenicol was employed to increase the copy number 
of the cDNA-containing plasmids in the host cells. The 
cells were then lysed using the methods described in 
R. E. Thayer, "An Improved Method For Detecting Foreign 

20 DNA In Plasmids Of Escherichia Coli ", Anal. Biochemistry . 

98, pp. 60-63 {1979), except that the nitrocellulose filters 
were placed on the stack of Whatman No. 1 filter disks 
for 2 min instead of 1 min and the absorption and blotting 
procedure was repeated only once instead of two more times. 

25 The prepared filters were then dried and baked at 80°C 

for 2 h. We obtained forty filters each containing 100-200 
clones/filter from this procedure. 

The filters were hybridized to a mouse serum 
albumin cDNA probe under non-stringent conditions because 

30 we believed there would be low homology between human serum 
albumin cDNA and mouse serum albumin cDNA. We prepared 
the mouse serum albumin cDNA probe for this screening by 
culturing a host containing mouse serum albumin cDNA [D. 
Kioussis, supra 1 and excising the albumin insert from the 

35 plasmid by Hind III restriction. That fragment was purfied 
on an 8% polyacrylamide gel and nick- translated in the 
presence all four o- 32 P-labelled nucleotides and polymerase 
I to a specific activity of 1.2 x 10 8 cpm/pg. We then 
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prepared the filters for hybridization by incubating them 
for 12 h at 42 °C in hybridization buffer [40% formamide, 
0.6 M Nad, 0.12 M Tris-HCl (pH8), 4 niM EDTA (hereinafter 
5 X SET) and 10% Dextran sulfate, 0.1% SDS]. 
5 For hybridization two filters were placed in a 

polyethylene bag in the presence of 2 ml of fresh hybri- 
dization buffer and about 100 pg/ml of buffer of Hinf l 
fragment of pBR322 and 1 x 10 6 cpm of the above- isolated 
cDNA probe were added. Hybridization was continued at 

10 42 °C for 18 h. The filters were then washed for several 
hours (1 X SET, 0.5% SDS) at room temperature and then 
washed at 42°C for 20 min (1 X SET). The filters were 
then developed and the positions giving the strongest 
evidence of hybridization to the labelled probe selected 

15 for further investigation. 

We used the twenty colonies, corresponding to 
the strongest hybridization-positive positions to prepare 
overnight cultures in 2YT broth at 37 °C. The cultures 
were then streaked onto 2 YT- agarose plates (containing 

20 25 pg/rol tetracycline) and replicates made, as before, 
for hybridization. Eighteen of the twenty clones were 
again strongly positive to the labelled mouse serum al- 
bumin cDNA probe in our hybridization assay. 

ISOLATION OF HUMAN SERUM ALBUMIN RELATED cDNA 

25 We prepared one ml cultures of the above-posi- 

tive 18 colonies and centrifuged those in an Eppendorf 
tube (5 min, 12000 rpm). The cultures were resuspended 
in 70 pi STET buffer (8% sucrose, 50 mM Tris-HCl (pH8), 
5mM EDTA, 5% Triton, 2 2 0), 5 pi Lysozyme (10 mg/ml) and 

30 allowed to stand at room temperature for 5 min. After 
boiling for 1 min, the lysed cells were centrifuged in an 
Eppendorf tube (5 min, 12000 rpm) and the supernatant 
combined with an equal volume of isopropanol. The iso- 
propanol mixture was cooled to -20°C for 1 h and again 

35 centrifuged (4°C, 5 min, 12000 rpm). The pellet was then 
resuspended in 40 pi H 2 0 for restriction analysis . 
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Each of the eighteen above-prepared mixtures 
contains a pKT218-human fetal liver cDNA recombinant that 
hybridizes strongly to our mouse albumin cDNA probe. Since 
this recombinant was prepared by dC/dG tailing and inserted 
5 at the PstI site of pKT218, the cDNA insert can be excised 
from each recombinant by Pst I restriction. Accordingly, 
the eighteen mixtures were restricted with Pst I under 
standard conditions and the fragments produced by the 
18 positive cultures sized on a 1.5% agarose gel. The 

10 clones containing the ten larger fragments (about 
1200-2250 base pairs each) were selected. 

We used Southern hybridization to determine the 
orientation of the inserts in the 10 clones containing 
the largest inserts. For Southern hybridization, we used 

15 a 300-nucleotide cDNA sequence that we had isolated earlier, 
essentially as described above, from a similarly prepared 
human fetal liver cDNA library, except that it was charac- 
terized by short cDNA segments. By partial nucleotide 
sequencing, we had demonstrated that this sequence encoded 

20 amino acids 319 to 416 of human serum albumin. Accordingly, 
we used that HSA-related cDNA sequence to prepare a probe 
for the 5* end of HSA by restriction with PstI (ESA has a 
PstI site at amino acid 363). This restriction resulted 
in a probe spanning amino acids 319 to 363 of ESA. We 

2 5 then nick- translated that probe substantially as described 
by S.Y. Tsai et al. # "Effect Of Estrogen On Gene Expression 
In The Chick Oviduct. Regulation Of The Ovomucoid Gene", 
Biochemistry , 17, pp. 5573-80 (1978). 

To prepare the ten mouse serum albumin "cDNA 

30 hybridization positive and larger fragment containing 
clones for hybridization to the above HSA-related probe, 
we treated the clones with various restriction enzymes: 
PvuII, Hind i II. Pst I and Pstl/Hindlll . The particular 
fragments produced by these digests together with their 

35 hydridization to the 319-363 HSA probe enabled us to 

determine the orientation of the inserts and the location 
of the 5' end of each insert. Two of the ten clones 
pKT218(BSA/33-l) and pKT218(HSA/17-3 ) f designated by their 
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positions on the filters, had the 5 f end in the correct 
orientation with respect to the direction of transcription 
% of the gene coding for pencillinase into which the frag- 
ment had been inserted in pKT2l8. The structures of these 
5 two clones are depicted in Figure 2. 

As can be seen in Figure 2, the insert of pKT2l8 
(HSA/33-1) (hereinafter designated as pcHSA/33-1) comprised 
at least a part of what we later designated to be the nu- 
cleotide sequence coding for the presequence of human pro 
10 serum albumin and presumbably the entire coding sequence 
for what we have designated human proserum albumin. The 
insert of pKT218 (HSA/17-3), (hereinafter designated as 
pcHSA/17-3) comprised about 250 nucleotides from the 3 1 
non-coding end of HSA and about the first 1600-1700 
15 nucleotides of the coding sequence for HSA (i.e., lacking 
the coding sequence for about first the 30-35 amino acids). 
Accordingly, a combination of the two inserts (HSA/33-1 
and HSA/17-3) affords a DNA sequence coding for substantial 
portions of the non-coding 3 f end and at least part of 
20 presequence, as well as the entire coding sequence of human 
proserum albumin. 

In order to construct this combination of HSA/33-1 
and HSA/17-3, we restricted each clone with Bglll and EcoRI 
and we ligated the shorter Bqlll-EcoRI fragment of pKT218 
25 (HSA/33-1) to the longer Bglll-EcoRI fragment of pKT2l8 
(HSA/17-3). This new construction pKT218 (HSA/33-1 (Ball 1- 
EcoRI)- HSA/17-3 (Bglll-EcoRI ) ("pcHSAll") is also depicted 
in Figure 2. Subsequent DNA sequencing revealed that this 
construction of pKT2l8 (HSA/33-1 (Bglll-EcoRI l-HSA/17-3 
30 (Bglll-EcoRI) caused elimination of a 36 nucleotide 

Bgl ll- Bgl ll fragment from the DNA sequence encoding ma- 
ture HSA. However, as demonstrated below, this deletion 
did not prevent that construction or other constructions 
also having that particular deletion from producing HSA-like 
35 polypeptides in hosts transformed with them. 
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EXPRESSION OF HUMAN SERUM 
ALBUMIN-LIKE POLYPEPTIDES 

We employed the recombinant DNA molecules de- 
scribed above: pKT218(HSA/33-l), pKT218 (HS A/17-3 ) and 
5 pKT218(HSA/33-l(BglII-EcoRI) - HSA/17-3 (Bglll -EcoRI ) ) ", to 
transform £. coli EB101 using the earlier-described trans- 
formation procedures. 

Cultures of these transformed organisms were 
then prepared and tested for the expression of human serum 

10 albumin-like polypeptides using a Broome-Gilbert assay 

[S. Broome and W. Gilbert, "Immunological Screening Method 
To Detect Specific Translation Products" , Proc. Natl. Acad. 
Sci. USA , 74, pp. 2746-49 (1978)], 

Anti-HSA IgG (available from Miles Laboratories) 

15 was purified using an HSA affinity column and labelled 

with 125 1 using standard techniques. Polyvinyl discs were 
then coated with anti-HSA IgG and the solid phase treated ' 
sequentially with the respective bacterial extracts lysed 
on the plates substantially as described by S. Broome and 

20 W. Gilbert, supra , and radioactively-labelled anti-HSA. 
Washing was carried out substantially as described by S. 
Broome and W. Gilbert, supra . The radioactivity of the 
solid phase was then monitored to determine the immune 
reaction. 

25 In this assay, radioactively-labelled antibody 

will bind only to sites on the solid support where a poly- 
peptide displaying an immunological property of HSA from 
the bacterial extract has been bound to the antibody of 
the solid phase. Therefore, labelled solid phase indi- 

30 cates the presence of BSA-like polypeptides in the extract. 

The results of the radioimmunoassays were as 

follows : 
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Extract Assay 

E. coli HBl01(pKT2l8(HSA/33-l)) + 

E. coli HBl01(pKT2l8(HSA/17-3)) + 

E. coli HBl01(pKT218(HSA/33-l + 
5 (BglII-EcoRI)-HSA/17-3 
(Bglll-EcoRI))) 

E. coli HBl01(pKT2l8) - (negative control) 

HSA + (positive control) 

USE OF OTHER CLONING VECTORS 
10 AND HOST ORGANISM 

It should also be understood that a wide variety 
of host/cloning vehicle combinations may be employed in . 
cloning the HSA-related cDNA prepared in accordance with 
this invention. For example, useful cloning vehicles may 

15 consist of segments of chromosomal, non-chromosomal and 
synthetic DNA seguences, such as various known derivatives 
of SV40 and known bacterial plasmids, e.g. , from E. coli 
including col El, pCRl, pBR322, pMB9 and their derivatives, 
wider host range plasmids, e.g. , RP4, and phage DNAs, e.g. , 

20 the numerous derivatives of phage X, e.g. , NM989, and other 
DNA phages, e.g. , M13 and Filamenteous single-stranded 
DNA phages and vectors derived from combinations of plasmids 
and phage DNAs such as plasmids which have been modified 
to employ phage DNA or other expression control sequences 

25 or yeast plasmids, such as the 2 \i plasmid, or derivatives 
thereof. Useful hosts may include bacterial hosts such 
as strains of E. coli , such as E. coli HB 101, E. coli . 
X1776, E. coli X2282, E. coli MRCI and strains of Pseudomonas . 
Bacillus subtilis . Bacillus stearothermophilus and other 

30 E. coli, bacilli, yeasts and other fungi, animal or plant 
hosts such as animal (including human) or plant cells in 
culture or other hosts. Of course, not all host/vector 
combinations may be equally efficient. The particular 
selection of host/cloning vehicle combination may be made 

35 by those of skill in the art after due consideration of the 
principles set forth without departing from the scope of 
this invention. For example, the selection of an appropriate 
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host is controlled by a number of factors recognized by 
the art. These include, for example, the compatibility 
with the chosen vector, the toxicity of proteins encoded 
by the hybrid plasmid, the ease of recovery of the desired 
5 protein, the expression characteristics of the vector and 
host, bio-safety and costs- A balance of these factors 
must be struck with the understanding that not all hosts 
may be equally effective for expression of a particular 
recombinant DNA molecule. 

10 Furthermore, within each specific cloning vehicle 

various sites may be selected for insertion of the HSA- 
related DNA. These sites are usually designated by the 
restriction endonuclease which cuts them. For example, 
in pBR322 the Pst I site is located in the gene for 

15 0 -lactamase, between the nucleotide triplets that code 
for amino acids 181 and 182 of that protein. This site 
was empolyed by S. Nagata et al., supra , in its synthesis 
of polypeptides displaying an immunological or biological 
activity of leukocyte interferon. One of the two Hind i! 

20 endonuclease recognition sites is between the triplets 
coding for amino acids 101 and 102 and one of the several 
Tag sites at the triplet coding for amino acid 45 of 
P -lactamase in pBR322. In similar fashion, the EcoRI site 
and the PvuII site in pBR322 lie outside any coding region, 

25 e.g., the EcoR I site being located between the genes coding 
for resistance to tetracycline and ampicillin, respectively. 
These sites are well recognized by those of skill in the 
art. It is, of course, to be understood that a cloning 
vehicle useful in this invention need not have a restriction 

30 endonuclease site for insertion of the chosen DNA fragment. 
Instead, the vehicle could be cut and joined to the fragment 
by alternative means. 

The specific vector or cloning vehicle, and in 
particular the site chosen therein for attachment of a 

35 selected DNA fragment to form a recombinant DNA molecule, 
is also determined by a variety of factors recognized by 
tlae art , e«g» / the number of sites susceptible to a 
particular restriction enzyme, the size of the protein to 
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be expressed, the susceptibility of the desired protein 
to proteolytic degradation by host cell enzymes, the con- 
' tamination of the protein to be expressed by host cell 
proteins difficult to remove during purification, the 

5 expression characteristics, such as the location of start 
and stop codons relative to the vector sequences, and other 
factors recogni2ed by those of skill in the art. The choice 
of a vector and an insertion site for a particular gene 
in that vector is determined by a balance of these factors, 

10 not* all selections being equally effective for a given 
case. 

Although several methods are known in the art 
for inserting foreign DNA into a cloning vehicle or vector 
to form a recombinant DNA molecule, the initial cloning 

15 method preferred in accordance with this invention was 

described above for pKT2l8. Of course, other known methods 
of inserting DNA sequences into cloning vehicles to form 
recombinant DNA molecules may be equally useful in this 
invention. These include, for example, dA-dl tailing, 

20 direct ligation, synthetic linkers, exonuclease and poly- 
merase-linked repair reactions followed by ligation, or 
extensions of the DNA strands with DNA polymerase and an 
appropriate single-stranded template followed by ligation. 

DNA FRAGMENT MAPPING AND 
25 NUCLEOTIDE SEQUENCE DETERMINATION 

Apart. from their use to produce HSA-like poly- 
peptides, the DNA sequences and recombinant DNA molecules 
of this invention are also useful in replication of speci- 
fic nucleotide sequences containing all or a portion of 
30 the genome of ESA. ESA DNA prepared in this way may be 
used to determine the nucleotide sequence of portions of 
the genome, particularly those portions coding for the 
active parts of HSA. From those sequences, the structure 
of the polypeptides themselves may be determined. Knowl- 
35 edge of these sequences also permits modifications to be 
made in the recombinant DNA molecules of this invention 
to improve the yield of the polypeptide produced, to 
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increase the activity of the polypeptides produced and to 
prepare derivatives of HSA. 

Recombinant DNA molecule pKT218(HSA/33-l(BglII- 
Eco RI ) -HSA/17-3 (Ball I- EcoR I ) ) , 6ince it contained substan- 
5 tial parts of the non-coding 3 1 sequence, as well as the 
complete coding sequence of what we have designated human 
proserum albumin (except for the missing 36 nucleotide 
Bgl ll- Bgl ll deletion) and at least a portion of its pre- 
sequence may be used to determine the nucleotide sequence 
10 of the HSA genome over which it extends. 

We constructed the physical map of the HSA/33-1 
(Bqlll-EcoRI )-BSA/17-3 (Bglll- EcoR I ) sequence by isolating 
the plasmid DNA, as before, and digesting the DNA with 
various restriction enzymes (New England Biolabs or BRL) 
15 in the recommended buffers by well-known procedures. The 
products of digestion were electrophoresed on 1% agarose 
gels. They were analyzed after visualization by staining 
with ethidium bromide and compared with detailed physical 
map of pBR322 [J. G. Sutcliffe, "Complete Nucleotide 
20 Sequence Of The Escherichia coli Plasmid pBR322", Cold 
Spring Harbor Symposium, 43, I, pp. 77-90 (1978)]. 

A- partial restriction map of the HSA sequence 
was constructed on the basis of these digestion patterns. 
This map is depicted in Figure 3. We then refined the 
2 5 map by sequencing the DNA inserts, substantially as de- 
scribed by A. M. Maxam and W. Gilbert, "A New Method For 
Sequencing DNA", Proc. Natl. Acad, Sci. USA , 74, pp. 560-64 
(1977). Figure 3 displays the various restriction frag- 
ments (the circles indicating the label and the arrow the 
30 direction of sequencing) and the sequencing strategy we 
employed to determine portions of the nucleotide sequence 
of pKT218 (HSA/33-l(^II-^RI)-HSA/17-3(BglII-EcoRI)). 
The relevant portions of the nucleotide sequence we ob- 
tained for the insert of PKT218 (HSA/33-1 (Ball I -EcoRI ) - 
35 HSA/17-3 (Bglll-EcoRI ) ) are depicted in Figure 4. This 
sequence does not exclude the possibility that modifica- 
tions to the sequence such as mutations, including single 
or multiple, base substitutions, deletions, insertions. 
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or inversions have not already occurred or may not be em- 
ployed subsequently to modify its expression or activity. 
It should also be understood that the remaining portions 
of the nucleotide sequence BSA/33-l(BglII-EcoRI )-HSA/17-3 
5 ( Bgl ll- Eco RI) may be determined by similar techniques. 

By comparing the polypeptides coded for by the 
various regions of the BSA/33-l( Bgl II- EcoR I )-ESA/17-3 
( Bgl ll- EcoR I) nucleotide sequence which we determined with 
the 585 amino acids determined for natural ESA [B. Meloun, 
10 supra .], it appears that the DNA sequence of that insert 
codes for sixteen amino acids (designated as amino acids 
-7 to -22 in Figure 4) of- a sequence which because of its 
hydrophobic character we have assigned as at least part 
of a putative presequence of human serum albumin and the 
15 entire amino acid sequence of a polypeptide having 591 
amino acids. On the basis of the reported amino acid 
sequence of natural ESA [Meloun, supra ] , we have desig- 
nated this larger protein human proserum albumin (it has 
six amino acids (designated as amino acids -1 to -6 in 
20 Figure 4) between the presequence and the amino acid se- 
quence reported for natural human serum albumin) . 

It also appears (from the portions of the nucleo- 
tide sequence that we have determined) that the coding 
sequence of BSA/33-l(BglII-EcoRI )-HSA/17-3 (Bglll-EcoRI ) 
25 is out of phase by a single nucleotide, as compared with 
the coding sequence of the gene coding for penicillinase 
resistance into which we have inserted it in pBR322. 
However, as a result of perhaps some internal start, hosts 
transformed with the hybrid gene (even out of phase) in 
30 pKT218 still produce HSA-like products. 

It should of course be understood that the cod- 
ing sequence of human proserum albumin, as isolated and 
described by u§, may be employed in other constructions 
to prepare plasmids where the coding sequence is in phase 
35 with the penicillinase gene or to prepare other plasmids 
to improve the yield and activity of HSA-like polypeptides 
in accordance with this invention. 
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IMPROVING THE YIELD AND ACTIVITY 
OF HSA-LIKE POLYPEPTIDES 
PRODUCED IN ACCORDANCE WITH THIS INVENTION 

The level of production of a protein in a host 
is governed by two major factors: the number of copies 
of its gene within the cell and the efficiency with which 
those gene copies are transcribed and translated. Effi- 
ciency of transcription and translation (which together 
comprise expression) is in turn dependent upon nucleotide 
sequences, normally situated ahead of the desired coding 
sequence. These nucleotide sequences or expression con- 
trol sequences define, inter alia, the location at which 
RNA polymerase interacts to initiate transcription (the 
promoter sequence) and at which ribosomes bind and inter- 
act with the mRNA (the product of transcription) to ini- 
tiate translation. Not all such expression control 
sequences function with equal efficiency. It is thus of 
advantage to separate the specific coding sequences for a 
desired protein from their adjacent nucleotide sequences 
and to fuse them instead to other known expression control 
sequences so as to favor higher levels of expression or 
perhaps higher levels of secretion and maturation from 
the host cell. This having been achieved, the newly 
engineered DNA fragment may be inserted into a multicopy 
plasmid or a bacteriophage derivative in order to increase 
the number of gene copies within the cell and thereby 
further to improve the yield of expressed protein. 

Several expression control sequences may be 
employed as described above. These include the operator, 
promoter and ribosome binding and interaction sequences 
(including sequences such as the Shine-Dalgamo sequences) 
of the lactose operon of E. coli ("the lac system" ), the 
corresponding sequences of the tryptophan synthetase system 
of £. coli ("the trj> system"), the p-lac system, the major 
operator and promoter regions of phage A (0 T P T and O-P*), 

L Li R R 

the control region of Filamenteous single stranded DNA 
phages, or other sequences which control the expression 
of genes of prokaryotic or eukaryotic cells and their 
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viruses and various combinations of those sequences, e.g. 
the TAC system or the TRC system. Therefore, to improve 
the production of a particular polypeptide in an appropriate 
host, the gene coding for that polypeptide may be selected 
5 as before and removed from the recombinant DNA molecule 

containing it and the gene reinserted into a another cloning 
vehicle or expression vector closer or in a more appro- 
priate relationship* to its former expression control 
sequence or under the control of one of the above improved 
10 expression control sequences. Such methods are known in 
the art. 

Prior to or subsequent to such constructions, 
the DNA sequences encoding the HSA-like polypeptides may 
also be combined with various signal sequences, both 

15 prokaryotic or eukaryotic in origin, or combinations 
thereof, bo as to permit the HSA-like polypeptide to be 
secreted from the host cell preferably with cleavage of 
the signal sequence during secretion* Such methods are 
described, for example, in Villa-Komaroff et al., supra , 

20 and Talmadge et al., supr a. 

Further increases in the cellular yield of the 
desired products depend upon an increase in the number of 
genes that can be utilized in the cell. This may be 
achieved, for illustration purposes, by insertion of re- 

2 5 combinant DNA molecules engineered in the way described 
previously into the temperate bacteriophage X (NM989), 
most simply by digestion of the plasmid with a restriction 
enzyme, to give a linear molecule which is then mixed with 
a restricted phage K cloning vehicle [e.g., of the type 



30 



* As used herein "relationship 11 may encompass many 
factors, e.g., the distance separating the expression 
enhancing and promoting regions of the recombinant DNA 
molecule and the inserted DNA sequence, the transcription 

35 and translation characteristics of the inserted DNA 
sequence or other sequences in the vector itself, the 
particular nucleotide sequence of the inserted DNA 
sequence and other sequences of the vector and the 
particular characteristics of the expression enhancing 

40 and promoting regions of the vector . 
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described by N. E. Murray et al., "Lambdoid Phages That 
Simplify The Recovery Of In Vitro Recombinants", Molec. 
gen. Genet, 150, pp. 53-61 (1977) and N- E. Murray et al. f 
"Molecular Cloning Of The DNA Ligase Gene From Bacterio- 
5 phage T4 M , J. Mol. Biol. , 132 , pp. 493-505 (1979)3 and 
the recombinant DNA molecule recircularized by incubation 
with DNA ligase. The desired recombinant phage is then 
selected as before and used to lysogenise a host strain 
of E. coli . 

10 Particularly useful X cloning vehicles contain 

a temperature-sensitive mutation in the repressor gene cl 
and suppressible mutations in gene S, the product of which 
is necessary for lysis of the host cell, and gene E, the 
product which is the major capsid protein of the virus. 

15 With this system the lysogenic cells are grown at 32°C 

and then heated to 45°C to induce excision of the prophage. 
Prolonged growth at 37 °C leads to high levels of production 
of the protein, which is retained within the cells, since 
these are not lysed by phage gene products in the normal 

20 way, and since the phage gene insert is not encapsidated 
it remains available for further transcription. Artificial 
lysis of the cells then releases the desired product in 
high yield. 

As another illustration, the coding sequence of 
25 a gene coding for an BSA-like polypeptide of this invention 
could be inserted into an expression vector under P L control, 
substantially as described in H. KQpper et al., "Cloning 
Of cDNA Of Major Antigen Of Foot And Mouth Disease Virus 
And Expression In E. coli ". Nature , 289 , pp. 555-59 (1981) 
30 for the genes coding for polypeptides displaying the 
antigenicity of FMDV. 

In addition, it should be understood that the 
yield of ESA-like polypeptides prepared in accordance with 
this invention may also be improved by substituting dif- 
35 ferent codons for some or all of the codons of the present 
DNA sequences. These substituted codons would code for amino 
acids similar or identical to those coded for by the codons 
replaced, but would be more favorably expressed in the 
particular host chosen for BSA production. 
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Finally f the activity of the polypeptides pro- 
duced by the recombinant DNA molecules of this invention 
may be improved by fragmenting, modifying or derivatizing 
the DNA sequences or polypeptides of this invention by 
5 well-known means, without departing from the scope of this 
invention. 

As one example of isolating a DNA sequence in 
accordance with this invention and joining it to an expres- 
sion control sequence, we isolated HS A/3 3-l( Bgl II- EcoR I) 
10 -HSA/17-3 (Bglll-EcoRI ) from the above-described vector 
and joined it downstream of the TAC promoter in another 
expression vector. 

To make this construction we restricted pKT218 
( HS A/33 -1 ( Ball I -EcoRI ) -HS A/17 -3 ( Bgll I -EcoRI ) ) , i sol ated 
15 as described previously, with EcoR I and Hind i I I as depicted 
in Figure 5. We then combined the EcoR I - Hind i 1 1 fragment 
containing the HSA coding sequences at its EcoR I terminus 
with a EcoR I - Hind i 1 1 fragment from pKKll4-ll (a gift of 
Jtirgen Brosius), containing the TAC expression control 
20 sequence. The resulting fragment (having two Hind i I I 
termini) was then recircularized by combination with the 
Hind i 1 1 fragment of pKKlO-2 (a derivative of pBR322 and a 
gift of Jtirgen Brosius). We designated this recombinant 
DNA molecule as pKT2 1 8 -TAC ( HS A/3 3 - 1 ( Bql I I -EcoRI ) -HS A/1 7 -3 
25 (Bglll-EcoRI)) or pcHSAl2. When we trans formed E. coli 
HB101 with this recombinant DNA molecule, we observed a 
• 20-30 times higher production of HSA-like polypeptides. 

To determine the location of HSA-like polypep- 
tides produced in host transformed with pKT218-TAC(HSA/33-l 
30 (BglII-EcoRl)-HSA/17-3(BolII-EcoRI)). We incubated 10 ml 
sulfur free-medium [R.B. Roberts et al. f "studies Of Bio- 
synthesis In E.coli ", Kirby Lithographic Co., Inc., p. 5 
(1957)] with 2 ml cultures of E.coli HBl01(pKT2l8-TAC(HSA/ 
33-1 ( Bql 1 1 -EcoRI ) -HS A/17 -3 < Bql 1 1 -EcoRI ) ) ) until OD = 0.2. 
35 We then induced the cultures with 5 mM IPTG and continued 
incubation to OD = 0.5. The cultures were then labelled 
with 3s S-H 2 S0 4 (5 mc i# 30 min, 37°C) and centrifuged 
(5 min, 12000 rpm). The pellets were resusp^ nded in Tris-HCl 
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(pE8), 25% sucrose, 25 \il lysozyme (10 mg/ml) and ice for 
15 min and again centrifuged (5 rain, 12000 rpra) and the 
supernatant (containing the periplasmic protein fraction 
of the transformed cells) removed. The pellets (sphero- 
5 plaets) were resuspended in 225 pi Tris-HCl (pH8), 25% 
sucrose and 725 pi Triton-lysis mix [0.15M Tris-HCl (pH8), 
0,2 M EDTA, 2% Triton X100] added. The mixture was allowed 
to stand in ice for 15 min and then centrifuged (5 min, 
12000 rpm, 4°C) to remove the cellular debris (the super- 

10 naturant containing the intracellular protein fraction of 
the transformed host cells). 

We then analyzed the periplasmic fraction-con- 
taining supernatant and the intracellular fraction-con- 
taining supernatant for the presence of HSA-like polypep- 

15 tides. We added 1000-fold excesses of anti-HSA to both 
fractions and allowed them to stand for 1 hour at 37°C. 
We then added 300 pi Sepharose Protein A (30 mg/ml), 
allowed the mixtures to stand at room temperature for 1 
hour and centrifuged them (5 min, 12000 rpm). The pellets 

20 were resuspended in 1 ml NET-NT buffer [0.65 M NaCl, 5mM 
EDTA, 50mM Tris-HCl (pH7.5), 1% Triton X100], centrifuged 
(5 min, 12000 rpm), resuspended in the same buffer, cen- 
trifuged (5 min, 12000 rpm) and resuspended in the same 
buffer, except that the buffer was 0.15 M NaCl not 0.65 M. 

2 5 We repeated this process of resuspension and centxifugation 
twice and then suspended the final pellets in 30 pi Lammld 
buffer. Analysis of the gel demonstrated the presence of 
about 50% of the HSA-like polypeptides produced by the 
transformed host in the periplasmic space fraction and 

30 about 50% of the HSA-like polypeptides in the intracellular 
fraction. 

The HSA-like polypeptide produced in this host 
appeared to be smaller than natural HSA. This observed 
difference in size to some extent may be explained by the 
35 missing 36 nucleotides in the HSA coding sequence. How- 
ever, the size difference is greater than that which could 
be accounted for by the 12 amino acids encoded by those 
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missing 36 nucleotides. The remainder of the size differ- 
ence may therefore be explained by production of the protein 
from an internal start, degradation of protein or perhaps 
incorrect processing. In any event the polypeptides dis- 
5 played an immunological activity consistent with natural 
HSA. 

Since the above construction also had the HSA 
coding sequence out of phase with the gene coding for peni- 
cillinase by a single nucleotide, we used it to prepare a 
10 construction having the correct reading frame between the 
gene coding for penicillinase and the HSA-related DNA 
fragment. As shown in Figure 6, we restricted the con- 
struction with BstEII and filled in the overlapping end 
of the fragment with a Klenow fragment. The DNA was then 
15 further restricted with EcoRI and the larger BstEIl-EcoRi 
fragment isolated. We combined this fragment with a 
fragment that we isolated from plasmid pKT234 (K. Talmadge 
et al., supra) that had been restricted with EcoRI and 
Pstl and filled in (Klenow fragment, dCTP only). The re- 
20 suiting construction has one nucleotide less between the 
portion of the gene coding for penicillinase and the HSA- 
related coding sequence. We designated this hybrid mole- 
cule pcHSAl3. Analysis of the HSA-like polypeptides 
produced by E.coli HB101 transformed with this modified 
25 hybrid gene demonstrated that about the same amount of 
HSA-like polypeptides were produced per cell as in the 
host transformed with pKT2l8-TAC(HSA/33-l(BglII-EcoRI )-HSA/ 
17-3(BglIi-EcoRI)). The level of expression is about 
4.5 x 10« molecule/cell. Moreover, the product had about 
the same size on an SDS/acryl amide gel as natural HSA. 
This was unexpected because the product was derived from 
a DNA seguence having the above described 36 nucleotide 
deletion. 

We have also prepared other expression systems 
characterized by DNA sequences encoding HSA-like polypep- 
tides. These constructions were designed to produce a 
DNA sequence encoding HSA-like polypeptides withou- the 
36-nucleotide Bglll-Bolli deletion. They were alio designed 
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to employ various combinations of expression control sequences 
and signal sequences before the HSA coding sequence. 

Referring now to Figure 7, we have depicted therein 
the construction of pcHSA30 from pKT2l8(HSA/33-l) and 
5 P KT2l8(ESA/17-3) by Xbal/EcoRI digestion and religation. 
This process avoids the 36 nucleotide deletion occasioned 
by Bol l I restriction of the HSA coding sequences* We then 
employed pcHSA30 to produce pcHSA31 and pcHSA32. These 
recombinant DNA molecules have a TAC expression control 

10 sequence derived from pGFY218 (a gift of Jtirgen Brosius). 
We transformed E.coli W3110I 2 with pcHSA32. E.coli W3110I* 2 
is a strain having a mutation in its i gene that represses 
expression of DNA sequences under the control of the TAC 
expression control sequence (e.g., the HSA coding sequence 

15 of pcHSA32) until induction by IPTG. Upon induction with 
IPTG, the transformed strain produced about 8000 molecules/ 
cell of an HSA-like polypeptide. This polypeptide had 
about the same size (SDS/acryl amide gel) as natural HSA. 

We also employed pcHSA31 and pcHSA32 to construct 

20 a recombinant DNA molecule (pcHSA36) wherein the HSA coding 
sequence is under the control of the TRC expression control 
sequence. The TRC sequence (a gift of Jiirgen Brosius) is 
a derivative of the previously described TAC system. We 
have depicted the construction of pcHSA36 in Figure 8. 

2 5 We employed pcESA36 to transform E.coli W3110I^. 

After induction as before, the transformed strain produced 
large amounts of a protein of about the same size 
(SDS/acryl amide gel) as natural HSA. Moreover, this protein 
appears primarily as a visible single band after antibody 

30 precipitation. Accordingly, it is believed that this con- 
struction, the most preferred construction of our invention, 
produces large amounts of an HSA-like polypeptide. However, 
the trans fonned host dies shortly after induction. We 
believe that this may be due to the toxic effects of the 

35 large amounts of HSA-like polypeptides produced by the 
host. Other strains and growth or induction conditions 
should permit maintenance of the strains transformed with 
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pcHSA36 and the production of large amounts of the desired 
product. 

Microorganims , DNA sequences and recombinant 
DNA molecules prepared by the processes described herein 
5 are exemplified by cultures deposited in the culture col- 
lection of the American Type Culture Collection in Rockville, 
Maryland on December 14, 1981, and identified there as 
HSA-A and assigned ATCC accession n umb er 39026. 

A: E^coli HBl01(pKT218(HSA/33-l(BglII-EcoRI )- 
10 BSA/17-3 (BglH-EcoRI ) ) ) 

Another microorganism, DNA sequence and recom- 
binant DNA molecule of this invention vas deposited in 
the culture collection of the American Type Culture 
Collection in Rockville, Maryland on Leoember 9th, 1982 

15 and identified there as HSA-B and assigned ATCC accession 
number 39253 . 

B: E.coli W3110I 2 (pcHSA 36) 
While we have hereinbefore presented a number 
of embodiments of this invention, it is apparent that our 

20 basic construction can be altered to provide other embodi- 
ments which utilize the processes and compositions of this 
invention. Therefore, it will be appreciated that the 
scope of this invention is to be defined by the claims 
appended hereto rather than by the specific embodiments 

25 which have been presented hereinbefore by way of example. 
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CVABIS : 

1." A DNA sequence characterized in that at 
least a portion thereof codes for a polypeptide display- 
ing an immunological or biological activity of human 

5 serum albumin and being selected from the group consist- 
ing of (a) HSA/33-1, HSA/17-3, HS A/33 -1 (Bglll -EcoRI ) -H5A/ 
17-3 (Bglll-EcoRI ) , HS A/3 3 -1 ( Xba I -EcoRI ) -HSA/17-3 (Xbal- 
EcoR I ) (b) DNA sequences which hybridize to any of the 
foregoing DNA sequences, (c) DNA sequences, from whatever 
10 source obtained, including natural, synthetic or semi- 
synthetic sources, related by mutation, including single 
or multiple, base substitutions, deletions, insertions 
and inversions to any of the foregoing DNA sequences, and 
(d) DNA sequences comprising sequences of codons which 

15 code for a polypeptide containing an amino acid sequence 
similar to those coded for by codons of any of the forego- 
ing DNA sequences, said DNA sequences b through d coding 
for a polypeptide displaying an immunological or biological 
activity of human serum albumin. 

20 2. A recombinant DNA molecule comprising a 

DNA sequence according to claim 1. 

3.- A recombinant DNA molecule according to 
claim 2, wherein said DNA sequence is operatively linked 
to an expression control sequence. 

25 4. A recombinant DNA- molecule according to 

claim 3 wherein the expression control sequence is selected 
from the group consisting of the E. coli lac system, the 
E. coli trp system, the E.coli p-lac system, the TAC system, 
the TRC system, the major operator and promoter regions 

30 . of phage X, the control region of Filamenteous single- 
stranded DNA phages, other sequences which control the 
expression of genes of prokaryotic or eukaryotic cells 
and their viruses and combinations thereof. 

5. A recombinant DNA molecule according to 

35 claim 4, selected from the group consisting of pKT218 

(HSA/33-1), pKT218(ESA/17-3), PKT218 ( HSA/3 3 - 1 ( Bgl I I -EcoRI ) - 
HSA/17-3 (Bgl 1 1 -EcoRI ) ) , pKT2 18 -TAC (HSA/3 3-1 (Boll I -EcoRI )- 
HSA/17-3 ( Bql I I -EcoRI ) ) pcHSA32 and pcESA36. 
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6. A host transformed with at least one recom- 
binant DNA molecule according to claim 4 or 5. 

7. A transformed host according to claim 6 
wherein the host transformed is selected from the group 

5 consisting of strains of E. coli f Pseudomonas , Bacillus 

subtilis, Bacillus stearothermophilus , other bacilli, yeasts, 
other fungi, animal and plant hosts and human tissue cells. 

8. A transformed host according to claim 6 or 7, 
selected from the group consisting of E.coli HBl01(pKT218 

10 (HSA/33-1)), E^coli HB101(pKT218(HSA/17-3)) , E.coli 

HB101 (PKT218 (HSA/33-1 (Bglll-EcoRI ) -BSA/17-3 ( Bgll I-EcoRI ) ) ), 
E.coli W31101 Q (pcHSA32) and E.coli W3110I 2 <pcHSA36). 

9. A polypeptide or fragment or derivative 
thereof displaying an immunological or biological activity 

15 of human serum albumin and produced by a transformed host 
according to any one of claims 6 to 8. 

10. A polypeptide characterized in that at least 
a portion of it is coded for by a DNA sequence according 
to claim 1. 

20 11. A method for producing a recombinant DNA 

molecule comprising the step of introducing into a cloning 
vehicle a DNA sequence according to claim 1. 

12. A method according to claim 11 further 
comprising the additional step of introducing into said 

25 cloning vehicle an expression control sequence, said expres- 
sion control sequence being introduced into said cloning 
vehicle so as to control and to regulate the expression 
of said DNA sequence. 

13. A method for transforming a host comprising 
30 the step of introducing into a host a recombinant DNA mole- 
cule according to claim 4 or 5. 

14. A method for producing a polypeptide dis- 
playing an immunological or biological activity of human 
serum albumin comprising the steps of transforming an 

35 appropriate host with a recombinant DNA molecule according 
tc laim 4 or 5; culturing said host; and collecting said 
po^ peptide. 

15. The method according to claim 14, character- 
ized in that the host transformed is selected from the 
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group consisting of strains of E. coli , Pseudomonas , 
Bacillus subtilis , Bacillus stearothermophilus , other 
bacilli, yeasts, fungi, animal or plant hosts, and human 
tissue cells. 

5 16. A method for producing a polypeptide dis- 

playing an immunological or biological activity of human 
serum albumin comprising .the steps of culturing a host 
transformed by a recombinant DNA molecule according to 
claim 4 or 5 and collecting said polypeptide, 

10 17. A process for selecting a DNA sequence cod- 

ing for a polypeptide displaying an immunological or bio- 
logical activity of human serum albumin from a group of / 
DNA sequences, comprising the step of screening the DNA 
sequences of the group to determine which' hybridize to at 

15 least one of the DNA sequences according to claim 1. 

18. The process of claim 17 wherein the DNA 
sequence screened is selected from the group consisting 
of DNA sequences from natural sources, synthetic DNA 
sequences, DNA sequences from recombinant DNA molecules 

20 and DNA sequences which are a combination of any of the 
foregoing DNA sequences. 

19. A pharmaceutically-acceptable composition 
comprising a polypeptide selected from the group consist- 
ing of the polypeptides of claim 9 or 10 and the polypep- 

25 tides produced by the methods of any one of claims 14 to 
16. , 

20. A method for treating humans comprising 
the step of treating them in a pharmaceutically-acceptable 
manner with a composition according to claim 19. 
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